Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 2125 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 12 |
| Duplicate rows (%) | 0.6% |
| Total size in memory | 249.1 KiB |
| Average record size in memory | 120.1 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 5 |
| Dataset has 12 (0.6%) duplicate rows | Duplicates |
name has a high cardinality: 451 distinct values | High cardinality |
host_name has a high cardinality: 201 distinct values | High cardinality |
last_review has a high cardinality: 308 distinct values | High cardinality |
id is highly correlated with host_id and 1 other fields | High correlation |
host_id is highly correlated with id | High correlation |
latitude is highly correlated with longitude | High correlation |
longitude is highly correlated with latitude | High correlation |
number_of_reviews is highly correlated with id and 1 other fields | High correlation |
reviews_per_month is highly correlated with number_of_reviews | High correlation |
id is highly correlated with host_id and 1 other fields | High correlation |
host_id is highly correlated with id | High correlation |
latitude is highly correlated with longitude | High correlation |
longitude is highly correlated with latitude | High correlation |
number_of_reviews is highly correlated with id and 1 other fields | High correlation |
reviews_per_month is highly correlated with number_of_reviews | High correlation |
number_of_reviews is highly correlated with reviews_per_month | High correlation |
reviews_per_month is highly correlated with number_of_reviews | High correlation |
longitude is highly correlated with neighbourhood and 4 other fields | High correlation |
price is highly correlated with room_type | High correlation |
number_of_reviews is highly correlated with latitude and 2 other fields | High correlation |
neighbourhood is highly correlated with longitude and 4 other fields | High correlation |
latitude is highly correlated with longitude and 5 other fields | High correlation |
reviews_per_month is highly correlated with number_of_reviews and 3 other fields | High correlation |
calculated_host_listings_count is highly correlated with longitude and 2 other fields | High correlation |
room_type is highly correlated with price | High correlation |
id is highly correlated with longitude and 5 other fields | High correlation |
host_id is highly correlated with longitude and 5 other fields | High correlation |
availability_365 has 228 (10.7%) zeros | Zeros |
Reproduction
| Analysis started | 2021-06-22 22:02:48.723691 |
|---|---|
| Analysis finished | 2021-06-22 22:03:55.465594 |
| Duration | 1 minute and 6.74 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 431 |
|---|---|
| Distinct (%) | 20.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23491517.6 |
| Minimum | 8521 |
|---|---|
| Maximum | 48157277 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.7 KiB |
Quantile statistics
| Minimum | 8521 |
|---|---|
| 5-th percentile | 1799157 |
| Q1 | 13169945 |
| median | 22888033 |
| Q3 | 33719501 |
| 95-th percentile | 45763073.4 |
| Maximum | 48157277 |
| Range | 48148756 |
| Interquartile range (IQR) | 20549556 |
Descriptive statistics
| Standard deviation | 13795942.02 |
|---|---|
| Coefficient of variation (CV) | 0.5872733407 |
| Kurtosis | -1.092730333 |
| Mean | 23491517.6 |
| Median Absolute Deviation (MAD) | 10398951 |
| Skewness | 0.06111615073 |
| Sum | 4.99194749 × 1010 |
| Variance | 1.903280161 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12318450 | 9 | 0.4% |
| 1225831 | 9 | 0.4% |
| 19346242 | 9 | 0.4% |
| 33245746 | 9 | 0.4% |
| 34664077 | 9 | 0.4% |
| 34944649 | 9 | 0.4% |
| 32986957 | 9 | 0.4% |
| 2538983 | 9 | 0.4% |
| 17285749 | 9 | 0.4% |
| 6185544 | 9 | 0.4% |
| Other values (421) | 2035 |
| Value | Count | Frequency (%) |
| 8521 | 7 | |
| 79762 | 6 | |
| 108898 | 6 | |
| 456429 | 6 | |
| 577384 | 5 | |
| 715532 | 1 | < 0.1% |
| 742574 | 7 | |
| 1140201 | 4 | |
| 1141088 | 6 | |
| 1154298 | 6 |
| Value | Count | Frequency (%) |
| 48157277 | 1 | < 0.1% |
| 48108594 | 2 | |
| 48107460 | 1 | < 0.1% |
| 47703691 | 1 | < 0.1% |
| 47538101 | 3 | |
| 47296747 | 3 | |
| 47223523 | 2 | |
| 46938818 | 3 | |
| 46903645 | 1 | < 0.1% |
| 46820093 | 2 |
| Distinct | 451 |
|---|---|
| Distinct (%) | 21.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.7 KiB |
| 3 Bed 2 Bath Tourist House near Kendall Square | 12 |
|---|---|
| CLOSE TO HARVARD&MIT | 10 |
| keyless private room near MIT,Central Sq 2 | 9 |
| City Oasis |Deck & Yard |Walk To Harvard MIT Train | 9 |
| Harvard MIT: Artist's Home | 9 |
| Other values (446) |
Length
| Max length | 70 |
|---|---|
| Median length | 42 |
| Mean length | 40.36705882 |
| Min length | 14 |
Characters and Unicode
| Total characters | 85780 |
|---|---|
| Distinct characters | 88 |
| Distinct categories | 13 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 5 ? |
Unique
| Unique | 59 ? |
|---|---|
| Unique (%) | 2.8% |
Sample
| 1st row | Middle Room in Shared Apt |
|---|---|
| 2nd row | Large Downstairs Room |
| 3rd row | Victorian Charm MIT/Harvard/Kendall/Central-1BR |
| 4th row | Harvard and MIT - Enjoy Comfort and Convenience! |
| 5th row | Charming Harvard Victorian |
Common Values
| Value | Count | Frequency (%) |
| 3 Bed 2 Bath Tourist House near Kendall Square | 12 | 0.6% |
| CLOSE TO HARVARD&MIT | 10 | 0.5% |
| keyless private room near MIT,Central Sq 2 | 9 | 0.4% |
| City Oasis |Deck & Yard |Walk To Harvard MIT Train | 9 | 0.4% |
| Harvard MIT: Artist's Home | 9 | 0.4% |
| I3 Private Room by Kendall/MIT/Central Statio | 9 | 0.4% |
| Hey Private Room close to MIT and Harvard Uni | 9 | 0.4% |
| Fabulous Flat Near Harvard Square | 9 | 0.4% |
| Convenient Studio *parking* 3-min. walk to subway | 9 | 0.4% |
| Luxury studio w/ parking by MIT/Harvard/BU/Fenway | 9 | 0.4% |
| Other values (441) | 2031 |
Length
| Value | Count | Frequency (%) |
| harvard | 714 | 5.2% |
| 513 | 3.7% | |
| room | 482 | 3.5% |
| to | 469 | 3.4% |
| in | 406 | 2.9% |
| cambridge | 396 | 2.9% |
| mit | 377 | 2.7% |
| private | 371 | 2.7% |
| near | 362 | 2.6% |
| square | 307 | 2.2% |
| Other values (588) | 9461 |
Most occurring characters
| Value | Count | Frequency (%) |
| 11867 | 13.8% | |
| a | 6542 | 7.6% |
| r | 6057 | 7.1% |
| e | 5459 | 6.4% |
| o | 4158 | 4.8% |
| t | 3884 | 4.5% |
| n | 3282 | 3.8% |
| i | 3260 | 3.8% |
| d | 2676 | 3.1% |
| l | 1886 | 2.2% |
| Other values (78) | 36709 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 52145 | |
| Uppercase Letter | 16706 | 19.5% |
| Space Separator | 11867 | 13.8% |
| Other Punctuation | 2681 | 3.1% |
| Decimal Number | 1394 | 1.6% |
| Dash Punctuation | 583 | 0.7% |
| Math Symbol | 130 | 0.2% |
| Other Letter | 86 | 0.1% |
| Open Punctuation | 84 | 0.1% |
| Close Punctuation | 84 | 0.1% |
| Other values (3) | 20 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6542 | |
| r | 6057 | |
| e | 5459 | |
| o | 4158 | 8.0% |
| t | 3884 | 7.4% |
| n | 3282 | 6.3% |
| i | 3260 | 6.3% |
| d | 2676 | 5.1% |
| l | 1886 | 3.6% |
| m | 1770 | 3.4% |
| Other values (16) | 13171 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 1503 | 9.0% |
| T | 1448 | 8.7% |
| R | 1335 | 8.0% |
| C | 1290 | 7.7% |
| M | 1252 | 7.5% |
| S | 1239 | 7.4% |
| I | 1165 | 7.0% |
| B | 1161 | 6.9% |
| A | 1019 | 6.1% |
| P | 671 | 4.0% |
| Other values (15) | 4623 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 998 | |
| , | 693 | |
| & | 328 | 12.2% |
| . | 200 | 7.5% |
| ! | 158 | 5.9% |
| * | 91 | 3.4% |
| # | 56 | 2.1% |
| : | 53 | 2.0% |
| ' | 44 | 1.6% |
| @ | 42 | 1.6% |
| Other values (2) | 18 | 0.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 429 | |
| 1 | 428 | |
| 3 | 297 | |
| 4 | 106 | 7.6% |
| 5 | 53 | 3.8% |
| 0 | 25 | 1.8% |
| 9 | 21 | 1.5% |
| 6 | 20 | 1.4% |
| 7 | 9 | 0.6% |
| 8 | 6 | 0.4% |
Other Letter
| Value | Count | Frequency (%) |
| 走 | 16 | |
| 路 | 16 | |
| 到 | 16 | |
| 哈 | 16 | |
| 佛 | 16 | |
| 二 | 6 | 7.0% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 78 | |
| | | 52 |
Space Separator
| Value | Count | Frequency (%) |
| 11867 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 583 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 84 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 84 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ^ | 8 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 6 |
Other Symbol
| Value | Count | Frequency (%) |
| ❤ | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 68851 | |
| Common | 16843 | 19.6% |
| Han | 86 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6542 | 9.5% |
| r | 6057 | 8.8% |
| e | 5459 | 7.9% |
| o | 4158 | 6.0% |
| t | 3884 | 5.6% |
| n | 3282 | 4.8% |
| i | 3260 | 4.7% |
| d | 2676 | 3.9% |
| l | 1886 | 2.7% |
| m | 1770 | 2.6% |
| Other values (41) | 29877 |
Common
| Value | Count | Frequency (%) |
| 11867 | ||
| / | 998 | 5.9% |
| , | 693 | 4.1% |
| - | 583 | 3.5% |
| 2 | 429 | 2.5% |
| 1 | 428 | 2.5% |
| & | 328 | 1.9% |
| 3 | 297 | 1.8% |
| . | 200 | 1.2% |
| ! | 158 | 0.9% |
| Other values (21) | 862 | 5.1% |
Han
| Value | Count | Frequency (%) |
| 走 | 16 | |
| 路 | 16 | |
| 到 | 16 | |
| 哈 | 16 | |
| 佛 | 16 | |
| 二 | 6 | 7.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 85676 | |
| CJK | 86 | 0.1% |
| Punctuation | 6 | < 0.1% |
| Dingbats | 6 | < 0.1% |
| None | 6 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 11867 | 13.9% | |
| a | 6542 | 7.6% |
| r | 6057 | 7.1% |
| e | 5459 | 6.4% |
| o | 4158 | 4.9% |
| t | 3884 | 4.5% |
| n | 3282 | 3.8% |
| i | 3260 | 3.8% |
| d | 2676 | 3.1% |
| l | 1886 | 2.2% |
| Other values (69) | 36605 |
CJK
| Value | Count | Frequency (%) |
| 走 | 16 | |
| 路 | 16 | |
| 到 | 16 | |
| 哈 | 16 | |
| 佛 | 16 | |
| 二 | 6 | 7.0% |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 6 |
Dingbats
| Value | Count | Frequency (%) |
| ❤ | 6 |
None
| Value | Count | Frequency (%) |
| : | 6 |
| Distinct | 223 |
|---|---|
| Distinct (%) | 10.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 91269292.18 |
| Minimum | 35384 |
|---|---|
| Maximum | 379297950 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.7 KiB |
Quantile statistics
| Minimum | 35384 |
|---|---|
| 5-th percentile | 430015 |
| Q1 | 12576232 |
| median | 43450256 |
| Q3 | 137754684 |
| 95-th percentile | 347407976 |
| Maximum | 379297950 |
| Range | 379262566 |
| Interquartile range (IQR) | 125178452 |
Descriptive statistics
| Standard deviation | 106605849.1 |
|---|---|
| Coefficient of variation (CV) | 1.16803633 |
| Kurtosis | 0.6280525953 |
| Mean | 91269292.18 |
| Median Absolute Deviation (MAD) | 38872190 |
| Skewness | 1.310243033 |
| Sum | 1.939472459 × 1011 |
| Variance | 1.136480706 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 15154687 | 87 | 4.1% |
| 43450256 | 86 | 4.0% |
| 373675137 | 57 | 2.7% |
| 21631889 | 54 | 2.5% |
| 81038 | 38 | 1.8% |
| 347407976 | 36 | 1.7% |
| 66604416 | 35 | 1.6% |
| 21745230 | 35 | 1.6% |
| 119672800 | 34 | 1.6% |
| 93503221 | 27 | 1.3% |
| Other values (213) | 1636 |
| Value | Count | Frequency (%) |
| 35384 | 6 | 0.3% |
| 81038 | 38 | |
| 93861 | 6 | 0.3% |
| 229956 | 25 | |
| 306681 | 13 | 0.6% |
| 393304 | 3 | 0.1% |
| 404360 | 6 | 0.3% |
| 405341 | 6 | 0.3% |
| 430015 | 12 | 0.6% |
| 886226 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 379297950 | 3 | 0.1% |
| 377541652 | 5 | 0.2% |
| 374663992 | 1 | < 0.1% |
| 374060072 | 5 | 0.2% |
| 373675137 | 57 | |
| 369560040 | 7 | 0.3% |
| 365311308 | 7 | 0.3% |
| 364585835 | 18 | 0.8% |
| 351555234 | 1 | < 0.1% |
| 347407976 | 36 |
| Distinct | 201 |
|---|---|
| Distinct (%) | 9.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.7 KiB |
| John | 124 |
|---|---|
| Steve | 95 |
| Liya | 57 |
| Ling Yi | 54 |
| Louisa | 38 |
| Other values (196) |
Length
| Max length | 27 |
|---|---|
| Median length | 5 |
| Mean length | 5.976941176 |
| Min length | 1 |
Characters and Unicode
| Total characters | 12701 |
|---|---|
| Distinct characters | 58 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 15 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | Adam |
|---|---|
| 2nd row | Adam |
| 3rd row | Paul |
| 4th row | Kyle |
| 5th row | Steve |
Common Values
| Value | Count | Frequency (%) |
| John | 124 | 5.8% |
| Steve | 95 | 4.5% |
| Liya | 57 | 2.7% |
| Ling Yi | 54 | 2.5% |
| Louisa | 38 | 1.8% |
| Alexander | 36 | 1.7% |
| Jurek | 35 | 1.6% |
| Toby & Quinn | 35 | 1.6% |
| Charlie | 34 | 1.6% |
| Mark | 33 | 1.6% |
| Other values (191) | 1584 |
Length
| Value | Count | Frequency (%) |
| john | 124 | 4.9% |
| 110 | 4.4% | |
| steve | 102 | 4.0% |
| liya | 57 | 2.3% |
| yi | 54 | 2.1% |
| ling | 54 | 2.1% |
| louisa | 38 | 1.5% |
| alexander | 36 | 1.4% |
| toby | 35 | 1.4% |
| jurek | 35 | 1.4% |
| Other values (207) | 1880 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1373 | 10.8% |
| e | 1343 | 10.6% |
| n | 1188 | 9.4% |
| i | 951 | 7.5% |
| r | 617 | 4.9% |
| l | 568 | 4.5% |
| o | 526 | 4.1% |
| J | 404 | 3.2% |
| 404 | 3.2% | |
| h | 374 | 2.9% |
| Other values (48) | 4953 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9696 | |
| Uppercase Letter | 2446 | 19.3% |
| Space Separator | 404 | 3.2% |
| Other Punctuation | 147 | 1.2% |
| Open Punctuation | 4 | < 0.1% |
| Close Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1373 | |
| e | 1343 | |
| n | 1188 | |
| i | 951 | |
| r | 617 | 6.4% |
| l | 568 | 5.9% |
| o | 526 | 5.4% |
| h | 374 | 3.9% |
| u | 354 | 3.7% |
| y | 326 | 3.4% |
| Other values (17) | 2076 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 404 | |
| A | 250 | |
| L | 232 | |
| M | 228 | |
| S | 165 | 6.7% |
| C | 143 | 5.8% |
| D | 131 | 5.4% |
| R | 122 | 5.0% |
| K | 115 | 4.7% |
| G | 104 | 4.3% |
| Other values (16) | 552 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 137 | |
| / | 10 | 6.8% |
Space Separator
| Value | Count | Frequency (%) |
| 404 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12142 | |
| Common | 559 | 4.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1373 | 11.3% |
| e | 1343 | 11.1% |
| n | 1188 | 9.8% |
| i | 951 | 7.8% |
| r | 617 | 5.1% |
| l | 568 | 4.7% |
| o | 526 | 4.3% |
| J | 404 | 3.3% |
| h | 374 | 3.1% |
| u | 354 | 2.9% |
| Other values (43) | 4444 |
Common
| Value | Count | Frequency (%) |
| 404 | ||
| & | 137 | 24.5% |
| / | 10 | 1.8% |
| ( | 4 | 0.7% |
| ) | 4 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12700 | |
| Latin 1 Sup | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1373 | 10.8% |
| e | 1343 | 10.6% |
| n | 1188 | 9.4% |
| i | 951 | 7.5% |
| r | 617 | 4.9% |
| l | 568 | 4.5% |
| o | 526 | 4.1% |
| J | 404 | 3.2% |
| 404 | 3.2% | |
| h | 374 | 2.9% |
| Other values (47) | 4952 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| ó | 1 |
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.7 KiB |
| Cambridgeport | |
|---|---|
| Mid-Cambridge | |
| The Port | |
| North Cambridge | |
| Wellington-Harrington | |
| Other values (8) |
Length
| Max length | 21 |
|---|---|
| Median length | 13 |
| Mean length | 13.44188235 |
| Min length | 7 |
Characters and Unicode
| Total characters | 28564 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | The Port |
|---|---|
| 2nd row | The Port |
| 3rd row | The Port |
| 4th row | Cambridgeport |
| 5th row | West Cambridge |
Common Values
| Value | Count | Frequency (%) |
| Cambridgeport | 327 | |
| Mid-Cambridge | 302 | |
| The Port | 272 | |
| North Cambridge | 226 | |
| Wellington-Harrington | 213 | |
| East Cambridge | 185 | |
| West Cambridge | 155 | |
| Neighborhood Nine | 141 | |
| Riverside | 113 | 5.3% |
| Strawberry Hill | 81 | 3.8% |
| Other values (3) | 110 | 5.2% |
Length
| Value | Count | Frequency (%) |
| cambridge | 572 | |
| cambridgeport | 327 | |
| mid-cambridge | 302 | |
| the | 272 | |
| port | 272 | |
| north | 226 | 7.0% |
| wellington-harrington | 213 | 6.6% |
| east | 185 | 5.7% |
| west | 155 | 4.8% |
| nine | 141 | 4.4% |
| Other values (8) | 565 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 2988 | 10.5% |
| i | 2589 | 9.1% |
| e | 2469 | 8.6% |
| g | 1839 | 6.4% |
| a | 1790 | 6.3% |
| d | 1763 | 6.2% |
| o | 1674 | 5.9% |
| t | 1672 | 5.9% |
| b | 1423 | 5.0% |
| C | 1201 | 4.2% |
| Other values (25) | 9156 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23043 | |
| Uppercase Letter | 3823 | 13.4% |
| Space Separator | 1105 | 3.9% |
| Dash Punctuation | 515 | 1.8% |
| Decimal Number | 39 | 0.1% |
| Other Punctuation | 39 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2988 | |
| i | 2589 | |
| e | 2469 | |
| g | 1839 | |
| a | 1790 | |
| d | 1763 | |
| o | 1674 | |
| t | 1672 | |
| b | 1423 | 6.2% |
| m | 1201 | 5.2% |
| Other values (9) | 3635 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1201 | |
| N | 508 | |
| W | 368 | 9.6% |
| M | 341 | 8.9% |
| T | 311 | 8.1% |
| H | 300 | 7.8% |
| P | 272 | 7.1% |
| E | 185 | 4.8% |
| R | 113 | 3.0% |
| A | 104 | 2.7% |
| Other values (2) | 120 | 3.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1105 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 515 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 39 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 39 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 26866 | |
| Common | 1698 | 5.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 2988 | |
| i | 2589 | 9.6% |
| e | 2469 | 9.2% |
| g | 1839 | 6.8% |
| a | 1790 | 6.7% |
| d | 1763 | 6.6% |
| o | 1674 | 6.2% |
| t | 1672 | 6.2% |
| b | 1423 | 5.3% |
| C | 1201 | 4.5% |
| Other values (21) | 7458 |
Common
| Value | Count | Frequency (%) |
| 1105 | ||
| - | 515 | |
| 2 | 39 | 2.3% |
| / | 39 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28564 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 2988 | 10.5% |
| i | 2589 | 9.1% |
| e | 2469 | 8.6% |
| g | 1839 | 6.4% |
| a | 1790 | 6.3% |
| d | 1763 | 6.2% |
| o | 1674 | 5.9% |
| t | 1672 | 5.9% |
| b | 1423 | 5.0% |
| C | 1201 | 4.2% |
| Other values (25) | 9156 |
| Distinct | 469 |
|---|---|
| Distinct (%) | 22.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 42.37307034 |
| Minimum | 42.35564 |
|---|---|
| Maximum | 42.40021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.7 KiB |
Quantile statistics
| Minimum | 42.35564 |
|---|---|
| 5-th percentile | 42.35889 |
| Q1 | 42.36608 |
| median | 42.37089 |
| Q3 | 42.37778 |
| 95-th percentile | 42.39384 |
| Maximum | 42.40021 |
| Range | 0.04457 |
| Interquartile range (IQR) | 0.0117 |
Descriptive statistics
| Standard deviation | 0.01010154402 |
|---|---|
| Coefficient of variation (CV) | 0.0002383953757 |
| Kurtosis | -0.2066345378 |
| Mean | 42.37307034 |
| Median Absolute Deviation (MAD) | 0.00528 |
| Skewness | 0.7177496906 |
| Sum | 90042.77448 |
| Variance | 0.0001020411916 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 42.3677 | 32 | 1.5% |
| 42.3695 | 18 | 0.8% |
| 42.36962 | 12 | 0.6% |
| 42.36832 | 12 | 0.6% |
| 42.39467 | 12 | 0.6% |
| 42.37315 | 12 | 0.6% |
| 42.37168 | 12 | 0.6% |
| 42.38768 | 12 | 0.6% |
| 42.38116 | 12 | 0.6% |
| 42.35969 | 11 | 0.5% |
| Other values (459) | 1980 |
| Value | Count | Frequency (%) |
| 42.35564 | 9 | |
| 42.35589 | 6 | |
| 42.35667 | 1 | < 0.1% |
| 42.35692 | 8 | |
| 42.35698 | 6 | |
| 42.35704 | 9 | |
| 42.3574 | 6 | |
| 42.35747 | 2 | 0.1% |
| 42.35756 | 6 | |
| 42.35768 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 42.40021 | 5 | |
| 42.39892 | 1 | < 0.1% |
| 42.39889 | 3 | 0.1% |
| 42.39884 | 1 | < 0.1% |
| 42.39805 | 1 | < 0.1% |
| 42.39764 | 5 | |
| 42.39634 | 8 | |
| 42.39633 | 9 | |
| 42.39612 | 4 | |
| 42.39604 | 7 |
| Distinct | 485 |
|---|---|
| Distinct (%) | 22.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -71.11003003 |
| Minimum | -71.15592 |
|---|---|
| Maximum | -71.06636 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 2125 |
| Negative (%) | 100.0% |
| Memory size | 16.7 KiB |
Quantile statistics
| Minimum | -71.15592 |
|---|---|
| 5-th percentile | -71.14064 |
| Q1 | -71.12288 |
| median | -71.10793 |
| Q3 | -71.09768 |
| 95-th percentile | -71.08283 |
| Maximum | -71.06636 |
| Range | 0.08956 |
| Interquartile range (IQR) | 0.0252 |
Descriptive statistics
| Standard deviation | 0.01768778101 |
|---|---|
| Coefficient of variation (CV) | -0.000248738202 |
| Kurtosis | -0.2239837659 |
| Mean | -71.11003003 |
| Median Absolute Deviation (MAD) | 0.011 |
| Skewness | -0.430160778 |
| Sum | -151108.8138 |
| Variance | 0.0003128575972 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -71.10584 | 32 | 1.5% |
| -71.10983 | 14 | 0.7% |
| -71.10338 | 13 | 0.6% |
| -71.0989 | 12 | 0.6% |
| -71.09921 | 12 | 0.6% |
| -71.1248 | 12 | 0.6% |
| -71.13326 | 11 | 0.5% |
| -71.13276 | 11 | 0.5% |
| -71.10493 | 10 | 0.5% |
| -71.1066 | 10 | 0.5% |
| Other values (475) | 1988 |
| Value | Count | Frequency (%) |
| -71.15592 | 6 | |
| -71.15478 | 2 | 0.1% |
| -71.15448 | 2 | 0.1% |
| -71.1544 | 6 | |
| -71.15405 | 1 | < 0.1% |
| -71.15401 | 1 | < 0.1% |
| -71.15354 | 8 | |
| -71.1535 | 1 | < 0.1% |
| -71.15302 | 6 | |
| -71.15278 | 4 |
| Value | Count | Frequency (%) |
| -71.06636 | 1 | < 0.1% |
| -71.0717 | 6 | |
| -71.07226 | 1 | < 0.1% |
| -71.07265 | 2 | 0.1% |
| -71.07373 | 3 | |
| -71.07707 | 5 | |
| -71.07726 | 6 | |
| -71.07787 | 6 | |
| -71.07849 | 4 | |
| -71.07861 | 6 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.7 KiB |
| Entire home/apt | |
|---|---|
| Private room |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 13.69270588 |
| Min length | 12 |
Characters and Unicode
| Total characters | 29097 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Private room |
|---|---|
| 2nd row | Private room |
| 3rd row | Entire home/apt |
| 4th row | Entire home/apt |
| 5th row | Entire home/apt |
Common Values
| Value | Count | Frequency (%) |
| Entire home/apt | 1199 | |
| Private room | 926 |
Length
Pie chart
| Value | Count | Frequency (%) |
| entire | 1199 | |
| home/apt | 1199 | |
| room | 926 | |
| private | 926 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 3324 | |
| e | 3324 | |
| r | 3051 | |
| o | 3051 | |
| i | 2125 | 7.3% |
| a | 2125 | 7.3% |
| 2125 | 7.3% | |
| m | 2125 | 7.3% |
| E | 1199 | 4.1% |
| n | 1199 | 4.1% |
| Other values (5) | 5449 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23648 | |
| Uppercase Letter | 2125 | 7.3% |
| Space Separator | 2125 | 7.3% |
| Other Punctuation | 1199 | 4.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 3324 | |
| e | 3324 | |
| r | 3051 | |
| o | 3051 | |
| i | 2125 | |
| a | 2125 | |
| m | 2125 | |
| n | 1199 | 5.1% |
| h | 1199 | 5.1% |
| p | 1199 | 5.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1199 | |
| P | 926 |
Space Separator
| Value | Count | Frequency (%) |
| 2125 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1199 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 25773 | |
| Common | 3324 | 11.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 3324 | |
| e | 3324 | |
| r | 3051 | |
| o | 3051 | |
| i | 2125 | |
| a | 2125 | |
| m | 2125 | |
| E | 1199 | 4.7% |
| n | 1199 | 4.7% |
| h | 1199 | 4.7% |
| Other values (3) | 3051 |
Common
| Value | Count | Frequency (%) |
| 2125 | ||
| / | 1199 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29097 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 3324 | |
| e | 3324 | |
| r | 3051 | |
| o | 3051 | |
| i | 2125 | 7.3% |
| a | 2125 | 7.3% |
| 2125 | 7.3% | |
| m | 2125 | 7.3% |
| E | 1199 | 4.1% |
| n | 1199 | 4.1% |
| Other values (5) | 5449 |
| Distinct | 267 |
|---|---|
| Distinct (%) | 12.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 127.9642353 |
| Minimum | 19 |
|---|---|
| Maximum | 950 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.7 KiB |
Quantile statistics
| Minimum | 19 |
|---|---|
| 5-th percentile | 35 |
| Q1 | 60 |
| median | 103 |
| Q3 | 162 |
| 95-th percentile | 300 |
| Maximum | 950 |
| Range | 931 |
| Interquartile range (IQR) | 102 |
Descriptive statistics
| Standard deviation | 97.03084353 |
|---|---|
| Coefficient of variation (CV) | 0.7582653333 |
| Kurtosis | 14.81399519 |
| Mean | 127.9642353 |
| Median Absolute Deviation (MAD) | 47 |
| Skewness | 2.798750012 |
| Sum | 271924 |
| Variance | 9414.984596 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 60 | 64 | 3.0% |
| 65 | 64 | 3.0% |
| 99 | 47 | 2.2% |
| 110 | 43 | 2.0% |
| 150 | 39 | 1.8% |
| 250 | 37 | 1.7% |
| 55 | 37 | 1.7% |
| 100 | 36 | 1.7% |
| 70 | 33 | 1.6% |
| 125 | 33 | 1.6% |
| Other values (257) | 1692 |
| Value | Count | Frequency (%) |
| 19 | 1 | < 0.1% |
| 23 | 2 | 0.1% |
| 25 | 9 | 0.4% |
| 27 | 3 | 0.1% |
| 28 | 3 | 0.1% |
| 29 | 28 | |
| 30 | 12 | |
| 31 | 4 | 0.2% |
| 32 | 8 | 0.4% |
| 33 | 15 |
| Value | Count | Frequency (%) |
| 950 | 4 | |
| 900 | 2 | 0.1% |
| 650 | 1 | < 0.1% |
| 551 | 1 | < 0.1% |
| 538 | 1 | < 0.1% |
| 509 | 1 | < 0.1% |
| 507 | 1 | < 0.1% |
| 500 | 6 | |
| 485 | 1 | < 0.1% |
| 475 | 6 |
minimum_nights
Real number (ℝ≥0)
| Distinct | 41 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.39670588 |
| Minimum | 1 |
|---|---|
| Maximum | 365 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 9 |
| 95-th percentile | 60 |
| Maximum | 365 |
| Range | 364 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 27.62080619 |
|---|---|
| Coefficient of variation (CV) | 2.228076269 |
| Kurtosis | 55.31757426 |
| Mean | 12.39670588 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 5.980811417 |
| Sum | 26343 |
| Variance | 762.9089345 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 760 | |
| 2 | 491 | |
| 3 | 207 | 9.7% |
| 30 | 165 | 7.8% |
| 32 | 73 | 3.4% |
| 4 | 65 | 3.1% |
| 28 | 45 | 2.1% |
| 5 | 35 | 1.6% |
| 31 | 27 | 1.3% |
| 60 | 26 | 1.2% |
| Other values (31) | 231 | 10.9% |
| Value | Count | Frequency (%) |
| 1 | 760 | |
| 2 | 491 | |
| 3 | 207 | 9.7% |
| 4 | 65 | 3.1% |
| 5 | 35 | 1.6% |
| 6 | 8 | 0.4% |
| 7 | 25 | 1.2% |
| 9 | 6 | 0.3% |
| 10 | 7 | 0.3% |
| 11 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 365 | 3 | 0.1% |
| 300 | 2 | 0.1% |
| 200 | 3 | 0.1% |
| 180 | 6 | 0.3% |
| 145 | 2 | 0.1% |
| 110 | 4 | 0.2% |
| 100 | 8 | 0.4% |
| 95 | 1 | < 0.1% |
| 91 | 22 | |
| 90 | 11 |
number_of_reviews
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 296 |
|---|---|
| Distinct (%) | 13.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 88.55576471 |
| Minimum | 1 |
|---|---|
| Maximum | 588 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 12 |
| median | 54 |
| Q3 | 139 |
| 95-th percentile | 280 |
| Maximum | 588 |
| Range | 587 |
| Interquartile range (IQR) | 127 |
Descriptive statistics
| Standard deviation | 97.91212729 |
|---|---|
| Coefficient of variation (CV) | 1.105655037 |
| Kurtosis | 3.32078882 |
| Mean | 88.55576471 |
| Median Absolute Deviation (MAD) | 50 |
| Skewness | 1.672521384 |
| Sum | 188181 |
| Variance | 9586.784671 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 171 | 8.0% |
| 2 | 85 | 4.0% |
| 3 | 61 | 2.9% |
| 4 | 46 | 2.2% |
| 9 | 33 | 1.6% |
| 30 | 32 | 1.5% |
| 20 | 31 | 1.5% |
| 10 | 29 | 1.4% |
| 18 | 26 | 1.2% |
| 5 | 24 | 1.1% |
| Other values (286) | 1587 |
| Value | Count | Frequency (%) |
| 1 | 171 | |
| 2 | 85 | |
| 3 | 61 | 2.9% |
| 4 | 46 | 2.2% |
| 5 | 24 | 1.1% |
| 6 | 23 | 1.1% |
| 7 | 15 | 0.7% |
| 8 | 10 | 0.5% |
| 9 | 33 | 1.6% |
| 10 | 29 | 1.4% |
| Value | Count | Frequency (%) |
| 588 | 4 | |
| 586 | 2 | 0.1% |
| 513 | 1 | < 0.1% |
| 500 | 1 | < 0.1% |
| 488 | 1 | < 0.1% |
| 476 | 1 | < 0.1% |
| 462 | 1 | < 0.1% |
| 449 | 1 | < 0.1% |
| 440 | 6 | |
| 432 | 8 |
| Distinct | 308 |
|---|---|
| Distinct (%) | 14.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.7 KiB |
| 2020-08-31 | 37 |
|---|---|
| 2020-11-22 | 28 |
| 2020-11-01 | 28 |
| 2020-05-31 | 27 |
| 2020-10-01 | 26 |
| Other values (303) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 21250 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 42 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | 2020-04-24 |
|---|---|
| 2nd row | 2020-04-01 |
| 3rd row | 2020-04-02 |
| 4th row | 2020-04-10 |
| 5th row | 2020-04-02 |
Common Values
| Value | Count | Frequency (%) |
| 2020-08-31 | 37 | 1.7% |
| 2020-11-22 | 28 | 1.3% |
| 2020-11-01 | 28 | 1.3% |
| 2020-05-31 | 27 | 1.3% |
| 2020-10-01 | 26 | 1.2% |
| 2020-10-12 | 26 | 1.2% |
| 2020-11-21 | 25 | 1.2% |
| 2020-10-25 | 25 | 1.2% |
| 2020-04-01 | 24 | 1.1% |
| 2021-01-31 | 22 | 1.0% |
| Other values (298) | 1857 |
Length
| Value | Count | Frequency (%) |
| 2020-08-31 | 37 | 1.7% |
| 2020-11-22 | 28 | 1.3% |
| 2020-11-01 | 28 | 1.3% |
| 2020-05-31 | 27 | 1.3% |
| 2020-10-01 | 26 | 1.2% |
| 2020-10-12 | 26 | 1.2% |
| 2020-11-21 | 25 | 1.2% |
| 2020-10-25 | 25 | 1.2% |
| 2020-04-01 | 24 | 1.1% |
| 2021-01-31 | 22 | 1.0% |
| Other values (298) | 1857 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6197 | |
| 2 | 5435 | |
| - | 4250 | |
| 1 | 2946 | |
| 3 | 488 | 2.3% |
| 5 | 396 | 1.9% |
| 9 | 375 | 1.8% |
| 4 | 354 | 1.7% |
| 8 | 329 | 1.5% |
| 6 | 265 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 17000 | |
| Dash Punctuation | 4250 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6197 | |
| 2 | 5435 | |
| 1 | 2946 | |
| 3 | 488 | 2.9% |
| 5 | 396 | 2.3% |
| 9 | 375 | 2.2% |
| 4 | 354 | 2.1% |
| 8 | 329 | 1.9% |
| 6 | 265 | 1.6% |
| 7 | 215 | 1.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4250 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 21250 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 6197 | |
| 2 | 5435 | |
| - | 4250 | |
| 1 | 2946 | |
| 3 | 488 | 2.3% |
| 5 | 396 | 1.9% |
| 9 | 375 | 1.8% |
| 4 | 354 | 1.7% |
| 8 | 329 | 1.5% |
| 6 | 265 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21250 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 6197 | |
| 2 | 5435 | |
| - | 4250 | |
| 1 | 2946 | |
| 3 | 488 | 2.3% |
| 5 | 396 | 1.9% |
| 9 | 375 | 1.8% |
| 4 | 354 | 1.7% |
| 8 | 329 | 1.5% |
| 6 | 265 | 1.2% |
reviews_per_month
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 612 |
|---|---|
| Distinct (%) | 28.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.325327059 |
| Minimum | 0.06 |
|---|---|
| Maximum | 10.86 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.7 KiB |
Quantile statistics
| Minimum | 0.06 |
|---|---|
| 5-th percentile | 0.202 |
| Q1 | 0.66 |
| median | 1.89 |
| Q3 | 3.51 |
| 95-th percentile | 6.238 |
| Maximum | 10.86 |
| Range | 10.8 |
| Interquartile range (IQR) | 2.85 |
Descriptive statistics
| Standard deviation | 1.967551977 |
|---|---|
| Coefficient of variation (CV) | 0.8461398878 |
| Kurtosis | 1.435349074 |
| Mean | 2.325327059 |
| Median Absolute Deviation (MAD) | 1.34 |
| Skewness | 1.182614222 |
| Sum | 4941.32 |
| Variance | 3.87126078 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 47 | 2.2% |
| 0.2 | 21 | 1.0% |
| 0.33 | 18 | 0.8% |
| 0.25 | 16 | 0.8% |
| 0.37 | 15 | 0.7% |
| 2 | 15 | 0.7% |
| 0.43 | 15 | 0.7% |
| 0.17 | 14 | 0.7% |
| 0.19 | 13 | 0.6% |
| 0.48 | 13 | 0.6% |
| Other values (602) | 1938 |
| Value | Count | Frequency (%) |
| 0.06 | 1 | < 0.1% |
| 0.07 | 2 | 0.1% |
| 0.08 | 1 | < 0.1% |
| 0.1 | 4 | 0.2% |
| 0.11 | 5 | |
| 0.12 | 7 | |
| 0.13 | 4 | 0.2% |
| 0.14 | 7 | |
| 0.15 | 7 | |
| 0.16 | 10 |
| Value | Count | Frequency (%) |
| 10.86 | 1 | |
| 10.45 | 1 | |
| 10.38 | 2 | |
| 10.22 | 1 | |
| 10.21 | 1 | |
| 10.15 | 1 | |
| 10.14 | 1 | |
| 10.13 | 1 | |
| 10 | 1 | |
| 9.98 | 1 |
| Distinct | 25 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.265411765 |
| Minimum | 1 |
|---|---|
| Maximum | 41 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 17 |
| Maximum | 41 |
| Range | 40 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 5.976582825 |
|---|---|
| Coefficient of variation (CV) | 1.135064662 |
| Kurtosis | 6.026209649 |
| Mean | 5.265411765 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 2.213838707 |
| Sum | 11189 |
| Variance | 35.71954226 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 600 | |
| 3 | 366 | |
| 2 | 256 | |
| 4 | 233 | 11.0% |
| 5 | 139 | 6.5% |
| 17 | 82 | 3.9% |
| 13 | 78 | 3.7% |
| 6 | 59 | 2.8% |
| 9 | 53 | 2.5% |
| 15 | 45 | 2.1% |
| Other values (15) | 214 | 10.1% |
| Value | Count | Frequency (%) |
| 1 | 600 | |
| 2 | 256 | |
| 3 | 366 | |
| 4 | 233 | 11.0% |
| 5 | 139 | 6.5% |
| 6 | 59 | 2.8% |
| 7 | 26 | 1.2% |
| 8 | 27 | 1.3% |
| 9 | 53 | 2.5% |
| 10 | 12 | 0.6% |
| Value | Count | Frequency (%) |
| 41 | 6 | 0.3% |
| 36 | 1 | < 0.1% |
| 35 | 3 | 0.1% |
| 33 | 3 | 0.1% |
| 29 | 8 | 0.4% |
| 23 | 26 | 1.2% |
| 22 | 14 | 0.7% |
| 18 | 13 | 0.6% |
| 17 | 82 | |
| 16 | 26 | 1.2% |
| Distinct | 354 |
|---|---|
| Distinct (%) | 16.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 173.1416471 |
| Minimum | 0 |
|---|---|
| Maximum | 365 |
| Zeros | 228 |
| Zeros (%) | 10.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 56 |
| median | 159 |
| Q3 | 300 |
| 95-th percentile | 365 |
| Maximum | 365 |
| Range | 365 |
| Interquartile range (IQR) | 244 |
Descriptive statistics
| Standard deviation | 129.5564728 |
|---|---|
| Coefficient of variation (CV) | 0.7482686864 |
| Kurtosis | -1.408486453 |
| Mean | 173.1416471 |
| Median Absolute Deviation (MAD) | 123 |
| Skewness | 0.1412735602 |
| Sum | 367926 |
| Variance | 16784.87964 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 228 | 10.7% |
| 365 | 133 | 6.3% |
| 1 | 60 | 2.8% |
| 180 | 41 | 1.9% |
| 364 | 35 | 1.6% |
| 179 | 27 | 1.3% |
| 363 | 26 | 1.2% |
| 362 | 25 | 1.2% |
| 90 | 24 | 1.1% |
| 147 | 21 | 1.0% |
| Other values (344) | 1505 |
| Value | Count | Frequency (%) |
| 0 | 228 | |
| 1 | 60 | 2.8% |
| 2 | 10 | 0.5% |
| 3 | 11 | 0.5% |
| 4 | 3 | 0.1% |
| 5 | 4 | 0.2% |
| 6 | 7 | 0.3% |
| 7 | 8 | 0.4% |
| 8 | 2 | 0.1% |
| 9 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 365 | 133 | |
| 364 | 35 | 1.6% |
| 363 | 26 | 1.2% |
| 362 | 25 | 1.2% |
| 361 | 12 | 0.6% |
| 360 | 15 | 0.7% |
| 359 | 15 | 0.7% |
| 358 | 6 | 0.3% |
| 357 | 5 | 0.2% |
| 356 | 6 | 0.3% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| id | name | host_id | host_name | neighbourhood | latitude | longitude | room_type | price | minimum_nights | number_of_reviews | last_review | reviews_per_month | calculated_host_listings_count | availability_365 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1193862 | Middle Room in Shared Apt | 229956 | Adam | The Port | 42.36494 | -71.10054 | Private room | 23 | 2 | 125 | 2020-04-24 | 1.49 | 3 | 4 |
| 1 | 1193875 | Large Downstairs Room | 229956 | Adam | The Port | 42.36433 | -71.09911 | Private room | 33 | 3 | 164 | 2020-04-01 | 1.98 | 3 | 7 |
| 2 | 1225831 | Victorian Charm MIT/Harvard/Kendall/Central-1BR | 3380576 | Paul | The Port | 42.36458 | -71.09845 | Entire home/apt | 155 | 3 | 429 | 2020-04-02 | 5.12 | 1 | 310 |
| 3 | 1307195 | Harvard and MIT - Enjoy Comfort and Convenience! | 7106416 | Kyle | Cambridgeport | 42.36392 | -71.10191 | Entire home/apt | 469 | 2 | 420 | 2020-04-10 | 6.11 | 1 | 68 |
| 4 | 1984737 | Charming Harvard Victorian | 8824696 | Steve | West Cambridge | 42.38116 | -71.13326 | Entire home/apt | 425 | 2 | 293 | 2020-04-02 | 4.07 | 2 | 306 |
| 5 | 2538983 | Spacious 2 bedrooms Apt-Roof deck NO Cleaning fee | 13000172 | Dan | Cambridgeport | 42.35704 | -71.10909 | Entire home/apt | 350 | 1 | 369 | 2020-04-20 | 6.38 | 1 | 231 |
| 6 | 3434822 | Harvard MIT: Artist's Home | 5871398 | Beverly | North Cambridge | 42.39403 | -71.13094 | Private room | 79 | 145 | 30 | 2020-04-16 | 0.42 | 2 | 359 |
| 7 | 3774900 | City Oasis |Deck & Yard |Walk To Harvard MIT Train | 6823717 | Juan Carlos | Cambridgeport | 42.36079 | -71.11365 | Entire home/apt | 170 | 2 | 261 | 2020-04-06 | 3.83 | 1 | 64 |
| 8 | 4956321 | Charming Third Floor Apartment | 25547444 | Susan | Agassiz | 42.38412 | -71.11484 | Entire home/apt | 145 | 4 | 122 | 2020-04-06 | 1.91 | 1 | 212 |
| 9 | 6185544 | Sunny 3 bed/2 bath-5 min to T- Harvard/MIT/Boston | 21745230 | Jurek | North Cambridge | 42.39633 | -71.13642 | Entire home/apt | 250 | 1 | 73 | 2020-04-15 | 1.26 | 6 | 276 |
Last rows
| id | name | host_id | host_name | neighbourhood | latitude | longitude | room_type | price | minimum_nights | number_of_reviews | last_review | reviews_per_month | calculated_host_listings_count | availability_365 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2115 | 46820093 | Marvelous 4 bed 1 bath 1 free parking Harvard SQ | 373675137 | Liya | West Cambridge | 42.37352 | -71.12260 | Entire home/apt | 150 | 2 | 2 | 2021-02-14 | 0.88 | 23 | 155 |
| 2116 | 46903645 | Most popular 3 bed rooms in Harvard Sq | 373675137 | Liya | West Cambridge | 42.37433 | -71.12411 | Entire home/apt | 151 | 2 | 1 | 2021-03-08 | 1.00 | 23 | 230 |
| 2117 | 46938818 | Walk To Harvard | 379297950 | Ismayil | Strawberry Hill | 42.37767 | -71.14900 | Entire home/apt | 119 | 1 | 1 | 2021-01-01 | 0.35 | 1 | 365 |
| 2118 | 47223523 | Entire private studio suite in Harvard/MIT | 45011296 | Lance | Mid-Cambridge | 42.36849 | -71.10537 | Entire home/apt | 295 | 3 | 2 | 2021-02-25 | 1.36 | 4 | 38 |
| 2119 | 47296747 | Central Square - Between Harvard & MIT | 74045635 | Ian | Cambridgeport | 42.36333 | -71.10285 | Private room | 35 | 2 | 3 | 2021-02-09 | 1.41 | 1 | 0 |
| 2120 | 47538101 | SoloPrivate Space | 270651080 | Olan | The Port | 42.37122 | -71.09942 | Entire home/apt | 85 | 2 | 5 | 2021-03-20 | 2.05 | 2 | 52 |
| 2121 | 47703691 | ENTIRE APT: Bright Sun Drenched Spot in Cambridge! | 886226 | Kibbee | Wellington-Harrington | 42.37229 | -71.09891 | Entire home/apt | 85 | 4 | 3 | 2021-03-23 | 3.00 | 1 | 0 |
| 2122 | 48107460 | I2 Private Room by Kendall Sq | 218493228 | John | Wellington-Harrington | 42.36992 | -71.09612 | Private room | 46 | 1 | 2 | 2021-03-09 | 2.00 | 3 | 287 |
| 2123 | 48108594 | Private room next to MIT/Harvard 3 | 247533528 | Roxy | Mid-Cambridge | 42.37185 | -71.10017 | Private room | 47 | 1 | 1 | 2021-02-16 | 0.77 | 3 | 235 |
| 2124 | 48157277 | Rice Street Studio | 374663992 | Amanda | North Cambridge | 42.39563 | -71.12863 | Entire home/apt | 116 | 2 | 2 | 2021-03-21 | 2.00 | 1 | 178 |
Most frequently occurring
| id | name | host_id | host_name | neighbourhood | latitude | longitude | room_type | price | minimum_nights | number_of_reviews | last_review | reviews_per_month | calculated_host_listings_count | availability_365 | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 7 | 7774831 | Charming house in Cambridge C33W | 12576232 | Thomas | The Port | 42.36946 | -71.09899 | Private room | 45 | 32 | 11 | 2020-11-25 | 0.21 | 5 | 365 | 3 |
| 0 | 1330779 | Huron Village Lower Unit Harvard Sq | 4642626 | Amy | Neighborhood Nine | 42.38905 | -71.12425 | Entire home/apt | 200 | 7 | 65 | 2020-05-31 | 0.72 | 5 | 0 | 2 |
| 1 | 3434822 | Harvard MIT: Artist's Home | 5871398 | Beverly | North Cambridge | 42.39403 | -71.13094 | Private room | 84 | 180 | 30 | 2020-04-16 | 0.37 | 2 | 365 | 2 |
| 2 | 3434822 | Harvard MIT: Artist's Home | 5871398 | Beverly | North Cambridge | 42.39403 | -71.13094 | Private room | 84 | 180 | 30 | 2020-04-16 | 0.38 | 2 | 365 | 2 |
| 3 | 3434822 | Harvard MIT: Artist's Home | 5871398 | Beverly | North Cambridge | 42.39403 | -71.13094 | Private room | 84 | 180 | 30 | 2020-04-16 | 0.39 | 2 | 365 | 2 |
| 4 | 3610778 | Private Room with Bunk Bed near Harvard/MIT | 4297079 | Charlton & Theresa | East Cambridge | 42.36903 | -71.08794 | Private room | 85 | 4 | 28 | 2020-09-25 | 0.35 | 1 | 91 | 2 |
| 5 | 3610778 | Private Room with Bunk Bed near Harvard/MIT | 4297079 | Charlton & Theresa | East Cambridge | 42.36903 | -71.08794 | Private room | 85 | 4 | 28 | 2020-09-25 | 0.36 | 1 | 0 | 2 |
| 6 | 7007117 | Room near Alewife Red Line T stop. | 21745230 | Jurek | North Cambridge | 42.39634 | -71.13683 | Private room | 60 | 1 | 29 | 2020-04-25 | 0.43 | 6 | 365 | 2 |
| 8 | 14213983 | Great Cambridge studio, great Harvard sq location | 1473780 | Bernardo | Riverside | 42.36660 | -71.11397 | Entire home/apt | 135 | 3 | 11 | 2020-11-24 | 0.20 | 1 | 88 | 2 |
| 9 | 14727253 | Charming House in Cambridge C36W | 12576232 | Thomas | The Port | 42.36931 | -71.09760 | Private room | 45 | 32 | 10 | 2020-11-20 | 0.19 | 5 | 365 | 2 |